An Effective Sentence Ordering Approach For Multi-Document Summarization Using Text Entailment
نویسندگان
چکیده
With the rapid development of modern technology electronically available textual information has increased to a considerable amount. Summarization of textual information manually from unstructured text sources creates overhead to the user, therefore a systematic approach is required. Summarization is an approach that focuses on providing the user with a condensed version of the original text but in real time applications extended document summarization is required for summarizing the text from multiple documents. The main focus of multidocument summarization is sentence ordering and ranking that arranges the collected sentences from multiple document in order to generate a well-organized summary. The improper order of extracted sentences significantly degrades readability and understandability of the summary. The existing system does multi document summarization by combining several preference measures such as chronology, probabilistic, precedence, succession, topical closeness experts to calculate the preference value between sentences. These approach to sentence ordering and ranking does not address context based similarity measure between sentences which is very essential for effective summarization. The proposed system addresses this issues through textual entailment expert system. This approach builds an entailment model which incorporates the cause and effect between sentences in the documents using the symmetric measure such as cosine similarity and non-symmetric measures such as unigram match, bigram match, longest common sub-sequence, skip gram match, stemming. The proposed system is efficient in providing user with a contextual summary which significantly improves the readability and understandability of the final coherent summary. Keywords-text summarization; preference experts; sentence ranking; sentence ordering; text entailment. _________________________________________________*****_________________________________________________
منابع مشابه
A preference learning approach to sentence ordering for multi-document summarization
Ordering information is a difficult but an important task for applications generating naturallanguage texts such as multi-document summarization, question answering, and conceptto-text generation. In multi-document summarization, information is selected from a set of source documents. Therefore, the optimal ordering of those selected pieces of information to create a coherent summary is not obv...
متن کاملImproving Coherence in Multi-document Summarization through Proper Ordering of Sentences
The problem of extracting salient information to include in a summary has been researched extensively in the field of automatic text summarization. However, coherent arrangement of the extracted information has received little attention. Specially, in the case of extractive multi-document text summarization, sentences that convey important information are selected from a set of documents. There...
متن کاملSentence Clustering-based Summarization of Multiple Text Documents
With the rapid growth of the World Wide Web, information overload is becoming a problem for an increasingly large number of people. Automatic Multidocument summarization can be an indispensable solution to reduce the information overload problem on the web. This kind of summarization facility helps users to see at a glance what a collection is about and provides a new way of managing a vast hoa...
متن کاملA Bottom-Up Approach to Sentence Ordering for Multi-Document Summarization
Ordering information is a difficult but important task for applications generating natural-language text. We present a bottom-up approach to arranging sentences extracted for multi-document summarization. To capture the association and order of two textual segments (eg, sentences), we define four criteria, chronology, topical-closeness, precedence, and succession. These criteria are integrated ...
متن کاملA survey on Automatic Text Summarization
Text summarization endeavors to produce a summary version of a text, while maintaining the original ideas. The textual content on the web, in particular, is growing at an exponential rate. The ability to decipher through such massive amount of data, in order to extract the useful information, is a major undertaking and requires an automatic mechanism to aid with the extant repository of informa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014